ZeuScansion: a tool for scansion of English poetry

نویسندگان

  • Manex Agirrezabal
  • Bertol Arrieta
  • Aitzol Astigarraga
  • Mans Hulden
چکیده

We present a finite state technology based system capable of performing metrical scansion of verse written in English. Scansion is the traditional task of analyzing the lines of a poem, marking the stressed and non-stressed elements, and dividing the line into metrical feet. The system’s workflow is composed of several subtasks designed around finite state machines that analyze verse by performing tokenization, part of speech tagging, stress placement, and unknown word stress pattern guessing. The scanner also classifies its input according to the predominant type of metrical foot found. We also present a brief evaluation of the system using a gold standard corpus of human-scanned verse, on which a per-syllable accuracy of 86.78% is reached. The program uses open-source components and is released under the GNU GPL license.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Machine Learning for Metrical Analysis of English Poetry

In this work we tackle the challenge of identifying rhythmic patterns in poetry written in English. Although poetry is a literary form that makes use standard meters usually repeated among different authors, we will see in this paper how performing such analyses is a difficult task in machine learning due to the unexpected deviations from such standard patterns. After breaking down some example...

متن کامل

A Comparison of Feature-Based and Neural Scansion of Poetry

Automatic analysis of poetic rhythm is a challenging task that involves linguistics, literature, and computer science. When the language to be analyzed is known, rule-based systems or data-driven methods can be used. In this paper, we analyze poetic rhythm in English and Spanish. We show that the representations of data learned from character-based neural models are more informative than the on...

متن کامل

Metrical Annotation of a Large Corpus of Spanish Sonnets: Representation, Scansion and Evaluation

In order to analyze metrical and semantics aspects of poetry in Spanish with computational techniques, we have developed a large corpus annotated with metrical information. In this paper we will present and discuss the development of this corpus: the formal representation of metrical patterns, the semi-automatic annotation process based on a new automatic scansion system, the main annotation pr...

متن کامل

Assigning stress to out-of-vocabulary words: three approaches

In this paper we address the task of automatically assigning primary stress to out-of-vocabulary words in English. This work forms a necessary component in a scansion system for English poetry. We propose three different approaches based on (1) word similarity, (2) handwritten linguistic rules and (3) machine learning. The first and last approach require stress-annotated corpora to train a mode...

متن کامل

Translating across Cultures: Yi Jing and Understanding Chinese Poetry

Translating across cultures stands for a complicated and demanding process. In his well-known article “Chinese Poetry and the English Reader,” David Hawkes discussed the challenges of translating classical Chinese poetry into English, but he failed to examine a most distinctive feature of Chinese poetry, yi jing 意境. Commonly translated as “poetic world,” yi jing is as much a cultural and philos...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • J. Language Modelling

دوره 4  شماره 

صفحات  -

تاریخ انتشار 2013